PeRSSonal's core functionality evaluation: Enhancing text labeling through personalized summaries

نویسندگان

  • Christos Bouras
  • Vassilis Poulopoulos
  • Vassilis Tsogkas
چکیده

In this manuscript we present the summarization and categorization subsystems of a complete mechanism that begins with web-page fetching and concludes with representation of the collected data to the end users through a personalized portal. The system intends to collect articles from major news portals and, following an algorithmic procedure, to create a more user friendly and personalized “view” of the articles. Before presenting the information back to the end user, the core of our mechanism automatically categorizes data and then extracts personalized summaries. We focalize to the core of the mechanism and more specifically, we present the algorithms used for the summarization and the categorization of texts. The algorithms are not utilized only for producing isolated data, targeted to a specific subsystem, but a combination of the algorithms, which achieves co-operation of the categorization and summarization mechanisms, is introduced in order to enhance text labeling through the personalized summaries that are constructed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Personalizing Text Summarization Based on Sentence Weighting

The amount of data that exists on the Internet is enormous enough to distract users when trying to find useful information. In addition, the expansive use of small screen devices for browsing the World Wide Web generates huge problems when trying to find and read information. A solution to these problems is to personalize the web and try to reduce algorithmically the amount of text. Many text s...

متن کامل

COMPENDIUM: a text summarisation tool for generating summaries of multiple purposes, domains, and genres

In this paper, we present a Text Summarisation tool, compendium, capable of generating the most common types of summaries. Regarding the input, single-and multi-document summaries can be produced; as the output, the summaries can be extractive or abstractive-oriented; and finally, concerning their purpose, the summaries can be generic, query-focused, or sentiment-based. The proposed architectur...

متن کامل

Berkovsky, Shlomo, Timothy Baldwin and Ingrid Zukerman (2008) Aspect-Based Personalized Text Summarization, In Proceedings of the 5th International Conference on Adaptive Hypermedia and Adaptive Web-Based Systems, Hanover, Germany, pp. 267-270

This work investigates user attitudes towards personalized summaries generated from a coarse-grained user model based on document aspects. We explore user preferences for summaries at differing degrees of fit with their stated interests, the impact of length on user ratings, and the faithfulness of personalized and general summaries.

متن کامل

Gleaning Information Better: Enhancing Retrieval Performance of Web Search Engines Using Postprocessing Filters

There have been several approaches to improving the eeciency of text-based Information Retrieval systems and more recently, Web search engines. Many of them have concentrated on improving the algorithms used to index or retrieve text material. In this paper, we propose a exible framework for enhancing the precision of retrieval, where a core IR system is supplemented with postprocessing in the ...

متن کامل

Automatic Labeling of Topic Models Using Text Summaries

Labeling topics learned by topic models is a challenging problem. Previous studies have used words, phrases and images to label topics. In this paper, we propose to use text summaries for topic labeling. Several sentences are extracted from the most related documents to form the summary for each topic. In order to obtain summaries with both high relevance, coverage and discrimination for all th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Data Knowl. Eng.

دوره 64  شماره 

صفحات  -

تاریخ انتشار 2008